Pesquisa | Portal Regional da BVS

1.

Domestication has altered gene expression and secondary metabolites in pea seed coat.

Klcová, Barbora; Balarynová, Jana; Trnený, Oldrich; Krejcí, Petra; Cechová, Monika Zajacová; Leonova, Tatiana; Gorbach, Daria; Frolova, Nadezhda; Kysil, Elana; Orlova, Anastasia; Ihling, Ð¡hristian; Frolov, Andrej; Bednár, Petr; Smýkal, Petr.

Plant J ; 2024 Apr 05.

Artigo em Inglês | MEDLINE | ID: mdl-38578789

RESUMO

The mature seed in legumes consists of an embryo and seed coat. In contrast to knowledge about the embryo, we know relatively little about the seed coat. We analyzed the gene expression during seed development using a panel of cultivated and wild pea genotypes. Gene co-expression analysis identified gene modules related to seed development, dormancy, and domestication. Oxidoreductase genes were found to be important components of developmental and domestication processes. Proteomic and metabolomic analysis revealed that domestication favored proteins involved in photosynthesis and protein metabolism at the expense of seed defense. Seed coats of wild peas were rich in cell wall-bound metabolites and the protective compounds predominated in their seed coats. Altogether, we have shown that domestication altered pea seed development and modified (mostly reduced) the transcripts along with the protein and metabolite composition of the seed coat, especially the content of the compounds involved in defense. We investigated dynamic profiles of selected identified phenolic and flavonoid metabolites across seed development. These compounds usually deteriorated the palatability and processing of the seeds. Our findings further provide resources to study secondary metabolism and strategies for improving the quality of legume seeds which comprise an important part of the human protein diet.

2.

The Complete Sequence and Comparative Analysis of Ape Sex Chromosomes.

Makova, Kateryna D; Pickett, Brandon D; Harris, Robert S; Hartley, Gabrielle A; Cechova, Monika; Pal, Karol; Nurk, Sergey; Yoo, DongAhn; Li, Qiuhui; Hebbar, Prajna; McGrath, Barbara C; Antonacci, Francesca; Aubel, Margaux; Biddanda, Arjun; Borchers, Matthew; Bomberg, Erich; Bouffard, Gerard G; Brooks, Shelise Y; Carbone, Lucia; Carrel, Laura; Carroll, Andrew; Chang, Pi-Chuan; Chin, Chen-Shan; Cook, Daniel E; Craig, Sarah J C; de Gennaro, Luciana; Diekhans, Mark; Dutra, Amalia; Garcia, Gage H; Grady, Patrick G S; Green, Richard E; Haddad, Diana; Hallast, Pille; Harvey, William T; Hickey, Glenn; Hillis, David A; Hoyt, Savannah J; Jeong, Hyeonsoo; Kamali, Kaivan; Kosakovsky Pond, Sergei L; LaPolice, Troy M; Lee, Charles; Lewis, Alexandra P; Loh, Yong-Hwee E; Masterson, Patrick; McCoy, Rajiv C; Medvedev, Paul; Miga, Karen H; Munson, Katherine M; Pak, Evgenia.

bioRxiv ; 2023 Dec 01.

Artigo em Inglês | MEDLINE | ID: mdl-38077089

RESUMO

Apes possess two sex chromosomes-the male-specific Y and the X shared by males and females. The Y chromosome is crucial for male reproduction, with deletions linked to infertility. The X chromosome carries genes vital for reproduction and cognition. Variation in mating patterns and brain function among great apes suggests corresponding differences in their sex chromosome structure and evolution. However, due to their highly repetitive nature and incomplete reference assemblies, ape sex chromosomes have been challenging to study. Here, using the state-of-the-art experimental and computational methods developed for the telomere-to-telomere (T2T) human genome, we produced gapless, complete assemblies of the X and Y chromosomes for five great apes (chimpanzee, bonobo, gorilla, Bornean and Sumatran orangutans) and a lesser ape, the siamang gibbon. These assemblies completely resolved ampliconic, palindromic, and satellite sequences, including the entire centromeres, allowing us to untangle the intricacies of ape sex chromosome evolution. We found that, compared to the X, ape Y chromosomes vary greatly in size and have low alignability and high levels of structural rearrangements. This divergence on the Y arises from the accumulation of lineage-specific ampliconic regions and palindromes (which are shared more broadly among species on the X) and from the abundance of transposable elements and satellites (which have a lower representation on the X). Our analysis of Y chromosome genes revealed lineage-specific expansions of multi-copy gene families and signatures of purifying selection. In summary, the Y exhibits dynamic evolution, while the X is more stable. Finally, mapping short-read sequencing data from >100 great ape individuals revealed the patterns of diversity and selection on their sex chromosomes, demonstrating the utility of these reference assemblies for studies of great ape evolution. These complete sex chromosome assemblies are expected to further inform conservation genetics of nonhuman apes, all of which are endangered species.

3.

The complete sequence of a human Y chromosome.

Rhie, Arang; Nurk, Sergey; Cechova, Monika; Hoyt, Savannah J; Taylor, Dylan J; Altemose, Nicolas; Hook, Paul W; Koren, Sergey; Rautiainen, Mikko; Alexandrov, Ivan A; Allen, Jamie; Asri, Mobin; Bzikadze, Andrey V; Chen, Nae-Chyun; Chin, Chen-Shan; Diekhans, Mark; Flicek, Paul; Formenti, Giulio; Fungtammasan, Arkarachai; Garcia Giron, Carlos; Garrison, Erik; Gershman, Ariel; Gerton, Jennifer L; Grady, Patrick G S; Guarracino, Andrea; Haggerty, Leanne; Halabian, Reza; Hansen, Nancy F; Harris, Robert; Hartley, Gabrielle A; Harvey, William T; Haukness, Marina; Heinz, Jakob; Hourlier, Thibaut; Hubley, Robert M; Hunt, Sarah E; Hwang, Stephen; Jain, Miten; Kesharwani, Rupesh K; Lewis, Alexandra P; Li, Heng; Logsdon, Glennis A; Lucas, Julian K; Makalowski, Wojciech; Markovic, Christopher; Martin, Fergal J; Mc Cartney, Ann M; McCoy, Rajiv C; McDaniel, Jennifer; McNulty, Brandy M.

Nature ; 621(7978): 344-354, 2023 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-37612512

RESUMO

The human Y chromosome has been notoriously difficult to sequence and assemble because of its complex repeat structure that includes long palindromes, tandem repeats and segmental duplications1-3. As a result, more than half of the Y chromosome is missing from the GRCh38 reference sequence and it remains the last human chromosome to be finished4,5. Here, the Telomere-to-Telomere (T2T) consortium presents the complete 62,460,029-base-pair sequence of a human Y chromosome from the HG002 genome (T2T-Y) that corrects multiple errors in GRCh38-Y and adds over 30 million base pairs of sequence to the reference, showing the complete ampliconic structures of gene families TSPY, DAZ and RBMY; 41 additional protein-coding genes, mostly from the TSPY family; and an alternating pattern of human satellite 1 and 3 blocks in the heterochromatic Yq12 region. We have combined T2T-Y with a previous assembly of the CHM13 genome4 and mapped available population variation, clinical variants and functional genomics data to produce a complete and comprehensive reference sequence for all 24 human chromosomes.

Assuntos

Cromossomos Humanos Y , Genômica , Análise de Sequência de DNA , Humanos , Sequência de Bases , Cromossomos Humanos Y/genética , DNA Satélite/genética , Variação Genética/genética , Genética Populacional , Genômica/métodos , Genômica/normas , Heterocromatina/genética , Família Multigênica/genética , Padrões de Referência , Duplicações Segmentares Genômicas/genética , Análise de Sequência de DNA/normas , Sequências de Repetição em Tandem/genética , Telômero/genética

4.

Accurate sequencing of DNA motifs able to form alternative (non-B) structures.

Weissensteiner, Matthias H; Cremona, Marzia A; Guiblet, Wilfried M; Stoler, Nicholas; Harris, Robert S; Cechova, Monika; Eckert, Kristin A; Chiaromonte, Francesca; Huang, Yi-Fei; Makova, Kateryna D.

Genome Res ; 33(6): 907-922, 2023 06.

Artigo em Inglês | MEDLINE | ID: mdl-37433640

RESUMO

Approximately 13% of the human genome at certain motifs have the potential to form noncanonical (non-B) DNA structures (e.g., G-quadruplexes, cruciforms, and Z-DNA), which regulate many cellular processes but also affect the activity of polymerases and helicases. Because sequencing technologies use these enzymes, they might possess increased errors at non-B structures. To evaluate this, we analyzed error rates, read depth, and base quality of Illumina, Pacific Biosciences (PacBio) HiFi, and Oxford Nanopore Technologies (ONT) sequencing at non-B motifs. All technologies showed altered sequencing success for most non-B motif types, although this could be owing to several factors, including structure formation, biased GC content, and the presence of homopolymers. Single-nucleotide mismatch errors had low biases in HiFi and ONT for all non-B motif types but were increased for G-quadruplexes and Z-DNA in all three technologies. Deletion errors were increased for all non-B types but Z-DNA in Illumina and HiFi, as well as only for G-quadruplexes in ONT. Insertion errors for non-B motifs were highly, moderately, and slightly elevated in Illumina, HiFi, and ONT, respectively. Additionally, we developed a probabilistic approach to determine the number of false positives at non-B motifs depending on sample size and variant frequency, and applied it to publicly available data sets (1000 Genomes, Simons Genome Diversity Project, and gnomAD). We conclude that elevated sequencing errors at non-B DNA motifs should be considered in low-read-depth studies (single-cell, ancient DNA, and pooled-sample population sequencing) and in scoring rare variants. Combining technologies should maximize sequencing accuracy in future studies of non-B DNA.

Assuntos

DNA Forma Z , Nanoporos , Humanos , Motivos de Nucleotídeos , Análise de Sequência de DNA , DNA/genética , Composição de Bases , Sequenciamento de Nucleotídeos em Larga Escala

5.

Comprehensive variant discovery in the era of complete human reference genomes.

Cechova, Monika; Miga, Karen H.

Nat Methods ; 20(1): 17-19, 2023 01.

Artigo em Inglês | MEDLINE | ID: mdl-36635553

Assuntos

Variação Genética , Genoma Humano , Humanos

6.

Separation of Isomeric Tau Phosphopeptides from Alzheimer's Disease Brain by Cyclic Ion Mobility Mass Spectrometry.

Kovác, Andrej; Majerová, Petra; Nytka, Marianna; Cechová, Monika Zajacová; Bednár, Petr; Hájek, Roman; Cooper-Shepherd, Dale A; Muck, Alexander; Lemr, Karel.

J Am Soc Mass Spectrom ; 34(3): 394-400, 2023 Mar 01.

Artigo em Inglês | MEDLINE | ID: mdl-36706338

RESUMO

Alzheimer's disease (AD) is a neurodegenerative disorder of increasing concern. It belongs to diseases termed tauopathies which are characterized by inclusions of abnormally hyperphosphorylated and truncated forms of the protein tau. Studies of tauopathies often focus on detection and characterization of these aberrant tau proteoforms, in particular the phosphorylation sites, which represent a significant analytical challenge for example when several phosphosites can be present on the same peptide. Such isomers can even be difficult to fully separate chromatographically. Since recently introduced cyclic ion mobility-mass spectrometry can offer different selectivity, we have investigated the closely positioned phosphorylation sites S214, T212, and T217 of a tryptic peptide from proline rich region of tau-TPSLPTPPTREPK. The conformational heterogeneity of the isomeric peptides in the gas phase hindered their separation due to their overlapping arrival time distributions. Increasing the resolution of the analysis alone is insufficient to distinguish the peptides in a mixture typical of patient samples. We therefore developed a method based on a combination of collision-induced dissociation, isomeric product ions (m/z 677) mobility separation and post-mobility dissociation to aid in analyzing the isomeric phosphopeptides of tau in diseased brain extract. For all three isomers (T212, S214, and T217), the ion mobility signal of the ion at m/z 677 was still observable at the concentration of 0.1 nmol/L. This work not only offers insights into the phosphorylation of tau protein in AD but also provides an analytical workflow for the characterization of challenging pathological protein modifications in neurodegenerative diseases.

Assuntos

Doença de Alzheimer , Humanos , Encéfalo/metabolismo , Espectrometria de Massas/métodos , Fosfopeptídeos/química , Proteínas tau/isolamento & purificação , Proteínas tau/metabolismo

7.

HiC-TE: a computational pipeline for Hi-C data analysis to study the role of repeat family interactions in the genome 3D organization.

Lexa, Matej; Cechova, Monika; Nguyen, Son Hoang; Jedlicka, Pavel; Tokan, Viktor; Kubat, Zdenek; Hobza, Roman; Kejnovsky, Eduard.

Bioinformatics ; 38(16): 4030-4032, 2022 08 10.

Artigo em Inglês | MEDLINE | ID: mdl-35781332

RESUMO

MOTIVATION: The role of repetitive DNA in the 3D organization of the interphase nucleus is a subject of intensive study. In studies of 3D nucleus organization, mutual contacts of various loci can be identified by Hi-C sequencing. Typical analyses use binning of read pairs by location to reduce noise. We use binning by repeat families instead to make similar conclusions about repeat regions. RESULTS: To achieve this, we combined Hi-C data, reference genome data and tools for repeat analysis into a Nextflow pipeline identifying and quantifying the contacts of specific repeat families. As an output, our pipeline produces heatmaps showing contact frequency and circular diagrams visualizing repeat contact localization. Using our pipeline with tomato data, we revealed the preferential homotypic interactions of ribosomal DNA, centromeric satellites and some LTR retrotransposon families and, as expected, little contact between organellar and nuclear DNA elements. While the pipeline can be applied to any eukaryotic genome, results in plants provide better coverage, since the built-in TE-greedy-nester software only detects tandems and LTR retrotransposons. Other repeats can be fed via GFF3 files. This pipeline represents a novel and reproducible way to analyze the role of repetitive elements in the 3D organization of genomes. AVAILABILITY AND IMPLEMENTATION: https://gitlab.fi.muni.cz/lexa/hic-te/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Análise de Dados , Genômica , Genômica/métodos , Genoma , Software , Retroelementos

8.

Satellite DNAs and human sex chromosome variation.

Cechova, Monika; Miga, Karen H.

Semin Cell Dev Biol ; 128: 15-25, 2022 08.

Artigo em Inglês | MEDLINE | ID: mdl-35644878

RESUMO

Satellite DNAs are present on every chromosome in the cell and are typically enriched in repetitive, heterochromatic parts of the human genome. Sex chromosomes represent a unique genomic and epigenetic context. In this review, we first report what is known about satellite DNA biology on human X and Y chromosomes, including repeat content and organization, as well as satellite variation in typical euploid individuals. Then, we review sex chromosome aneuploidies that are among the most common types of aneuploidies in the general population, and are better tolerated than autosomal aneuploidies. This is demonstrated also by the fact that aging is associated with the loss of the X, and especially the Y chromosome. In addition, supernumerary sex chromosomes enable us to study general processes in a cell, such as analyzing heterochromatin dosage (i.e. additional Barr bodies and long heterochromatin arrays on Yq) and their downstream consequences. Finally, genomic and epigenetic organization and regulation of satellite DNA could influence chromosome stability and lead to aneuploidy. In this review, we argue that the complete annotation of satellite DNA on sex chromosomes in human, and especially in centromeric regions, will aid in explaining the prevalence and the consequences of sex chromosome aneuploidies.

Assuntos

DNA Satélite , Heterocromatina , Aneuploidia , Centrômero/genética , Cromossomos Humanos , DNA Satélite/genética , Heterocromatina/genética , Humanos , Cromossomos Sexuais/genética

9.

The loss of polyphenol oxidase function is associated with hilum pigmentation and has been selected during pea domestication.

Balarynová, Jana; Klcová, Barbora; Sekaninová, Jana; Kobrlová, Lucie; Cechová, Monika Zajacová; Krejcí, Petra; Leonova, Tatiana; Gorbach, Daria; Ihling, Christian; Smrzová, Lucie; Trnený, Oldrich; Frolov, Andrej; Bednár, Petr; Smýkal, Petr.

New Phytol ; 235(5): 1807-1821, 2022 09.

Artigo em Inglês | MEDLINE | ID: mdl-35585778

RESUMO

Seed coats serve as protective tissue to the enclosed embryo. As well as mechanical there are also chemical defence functions. During domestication, the property of the seed coat was altered including the removal of the seed dormancy. We used a range of genetic, transcriptomic, proteomic and metabolomic approaches to determine the function of the pea seed polyphenol oxidase (PPO) gene. Sequencing analysis revealed one nucleotide insertion or deletion in the PPO gene, with the functional PPO allele found in all wild pea samples, while most cultivated peas have one of the three nonfunctional ppo alleles. PPO functionality cosegregates with hilum pigmentation. PPO gene and protein expression, as well as enzymatic activity, was downregulated in the seed coats of cultivated peas. The functionality of the PPO gene relates to the oxidation and polymerisation of gallocatechin in the seed coat. Additionally, imaging mass spectrometry supports the hypothesis that hilum pigmentation is conditioned by the presence of both phenolic precursors and sufficient PPO activity. Taken together these results indicate that the nonfunctional polyphenol oxidase gene has been selected during pea domestication, possibly due to better seed palatability or seed coat visual appearance.

Assuntos

Catecol Oxidase , Catecol Oxidase/genética , Catecol Oxidase/metabolismo , Domesticação , /metabolismo , Pigmentação , Proteômica , Sementes/genética , Sementes/metabolismo

10.

Combination of electronically driven micromanipulation with laser desorption ionization mass spectrometry - The unique tool for analysis of seed coat layers and revealing the mystery of seed dormancy.

Krejcí, Petra; Cechová, Monika Zajacová; Nádvorníková, Jana; Barták, Petr; Kobrlová, Lucie; Balarynová, Jana; Smýkal, Petr; Bednár, Petr.

Talanta ; 242: 123303, 2022 May 15.

Artigo em Inglês | MEDLINE | ID: mdl-35183978

RESUMO

Electronically driven micromanipulation (EDM) with microscopic control was used as a novel tool for sample preparation prior to direct (matrix assisted) laser desorption/ionization mass spectrometric ((MA)LDI-MS) analysis of mature pea seed coat composition in defined layers. Microscissors were used for seed coat fragment shape adjustment, microtweezers for sample holding and "microjackhammer" Milling Pro for precise mechanical removing of cell layers in defined depths (2, 5 or 10 µm). These procedures circumvent the application of embedding media or enzymatic digestion of seed coat that would complicate mass spectra interpretation (presence of matrix signals, analyte signals enhancement or attenuation) and represent alternative for 3D metabolites profiling. In addition, microinjector was used to apply a solution on intact or micropeeled seed coat surface in nano-volumes, i.e. MALDI matrix and/or lithium salt, that provide improvement of signal of sugars. Utilization of EDM enabled optimization of matrix composition on a single small fragment of seed coat overcoming thus problems with biological (seed to seed) variability. LDI-MS data were studied by multivariate statistical analysis and significant metabolites in particular layers of seed coats were identified. Normalized intensities of signals (NS) of long-chain hydroxylated fatty acids (HLFA) on intact dormant pea genotype (JI64) seed coats were significantly higher than in their counterparts treated by micropeeling confirming HLFA accumulation in outermost layers (cutin). Fatty acids distribution differences between dormant and non-dormant genotypes were explored in detail. On the other hand, NS of sugar chains and particular polyphenols were significantly higher in micropeeled seed coats of studied dormant and non-dormant genotypes than in intact seed coats. Furthermore, combination of EDM with mass spectrometry imaging (MSI) allowed vertical profiling of metabolites in hilum (a place of former attachment of seed to maternal plant) and comparison of its composition with surrounding tissues. The obtained results contribute to the understanding of relations between seed coat chemical composition and physical seed dormancy.

Assuntos

Dormência de Plantas , Sementes , Lasers , Micromanipulação , Espectrometria de Massas por Ionização e Dessorção a Laser Assistida por Matriz/métodos

11.

Ten simple rules for biologists initiating a collaboration with computer scientists.

Cechova, Monika.

PLoS Comput Biol ; 16(10): e1008281, 2020 10.

Artigo em Inglês | MEDLINE | ID: mdl-33091012

Assuntos

Biologia Computacional , Comportamento Cooperativo , Biologia/educação , Biologia/organização & administração , Comunicação , Biologia Computacional/educação , Biologia Computacional/métodos , Biologia Computacional/organização & administração , Humanos , Informática

12.

Dynamic evolution of great ape Y chromosomes.

Cechova, Monika; Vegesna, Rahulsimham; Tomaszkiewicz, Marta; Harris, Robert S; Chen, Di; Rangavittal, Samarth; Medvedev, Paul; Makova, Kateryna D.

Proc Natl Acad Sci U S A ; 117(42): 26273-26280, 2020 10 20.

Artigo em Inglês | MEDLINE | ID: mdl-33020265

RESUMO

The mammalian male-specific Y chromosome plays a critical role in sex determination and male fertility. However, because of its repetitive and haploid nature, it is frequently absent from genome assemblies and remains enigmatic. The Y chromosomes of great apes represent a particular puzzle: their gene content is more similar between human and gorilla than between human and chimpanzee, even though human and chimpanzee share a more recent common ancestor. To solve this puzzle, here we constructed a dataset including Ys from all extant great ape genera. We generated assemblies of bonobo and orangutan Ys from short and long sequencing reads and aligned them with the publicly available human, chimpanzee, and gorilla Y assemblies. Analyzing this dataset, we found that the genus Pan, which includes chimpanzee and bonobo, experienced accelerated substitution rates. Pan also exhibited elevated gene death rates. These observations are consistent with high levels of sperm competition in Pan Furthermore, we inferred that the great ape common ancestor already possessed multicopy sequences homologous to most human and chimpanzee palindromes. Nonetheless, each species also acquired distinct ampliconic sequences. We also detected increased chromatin contacts between and within palindromes (from Hi-C data), likely facilitating gene conversion and structural rearrangements. Our results highlight the dynamic mode of Y chromosome evolution and open avenues for studies of male-specific dispersal in endangered great ape species.

Assuntos

Hominidae/genética , Cromossomo Y/genética , Animais , Evolução Biológica , Evolução Molecular , Conversão Gênica , Gorilla gorilla/genética , Humanos , Pan paniscus/genética , Pan troglodytes/genética , Pongo/genética , Análise de Sequência de DNA

13.

Probably Correct: Rescuing Repeats with Short and Long Reads.

Cechova, Monika.

Genes (Basel) ; 12(1)2020 12 31.

Artigo em Inglês | MEDLINE | ID: mdl-33396198

RESUMO

Ever since the introduction of high-throughput sequencing following the human genome project, assembling short reads into a reference of sufficient quality posed a significant problem as a large portion of the human genome-estimated 50-69%-is repetitive. As a result, a sizable proportion of sequencing reads is multi-mapping, i.e., without a unique placement in the genome. The two key parameters for whether or not a read is multi-mapping are the read length and genome complexity. Long reads are now able to span difficult, heterochromatic regions, including full centromeres, and characterize chromosomes from "telomere to telomere". Moreover, identical reads or repeat arrays can be differentiated based on their epigenetic marks, such as methylation patterns, aiding in the assembly process. This is despite the fact that long reads still contain a modest percentage of sequencing errors, disorienting the aligners and assemblers both in accuracy and speed. Here, I review the proposed and implemented solutions to the repeat resolution and the multi-mapping read problem, as well as the downstream consequences of reference choice, repeat masking, and proper representation of sex chromosomes. I also consider the forthcoming challenges and solutions with regards to long reads, where we expect the shift from the problem of repeat localization within a single individual to the problem of repeat positioning within pangenomes.

Assuntos

Mapeamento Cromossômico/métodos , Genoma Humano , Repetições de Microssatélites , Cromossomos Sexuais/química , Centrômero/química , Biologia Computacional/métodos , Metilação de DNA , Tamanho do Genoma , Humanos , Telômero/química

14.

High Satellite Repeat Turnover in Great Apes Studied with Short- and Long-Read Technologies.

Cechova, Monika; Harris, Robert S; Tomaszkiewicz, Marta; Arbeithuber, Barbara; Chiaromonte, Francesca; Makova, Kateryna D.

Mol Biol Evol ; 36(11): 2415-2431, 2019 Nov 01.

Artigo em Inglês | MEDLINE | ID: mdl-31273383

RESUMO

Satellite repeats are a structural component of centromeres and telomeres, and in some instances, their divergence is known to drive speciation. Due to their highly repetitive nature, satellite sequences have been understudied and underrepresented in genome assemblies. To investigate their turnover in great apes, we studied satellite repeats of unit sizes up to 50 bp in human, chimpanzee, bonobo, gorilla, and Sumatran and Bornean orangutans, using unassembled short and long sequencing reads. The density of satellite repeats, as identified from accurate short reads (Illumina), varied greatly among great ape genomes. These were dominated by a handful of abundant repeated motifs, frequently shared among species, which formed two groups: 1) the (AATGG)n repeat (critical for heat shock response) and its derivatives; and 2) subtelomeric 32-mers involved in telomeric metabolism. Using the densities of abundant repeats, individuals could be classified into species. However, clustering did not reproduce the accepted species phylogeny, suggesting rapid repeat evolution. Several abundant repeats were enriched in males versus females; using Y chromosome assemblies or Fluorescent In Situ Hybridization, we validated their location on the Y. Finally, applying a novel computational tool, we identified many satellite repeats completely embedded within long Oxford Nanopore and Pacific Biosciences reads. Such repeats were up to 59 kb in length and consisted of perfect repeats interspersed with other similar sequences. Our results based on sequencing reads generated with three different technologies provide the first detailed characterization of great ape satellite repeats, and open new avenues for exploring their functions.

15.

Noise-cancelling repeat finder: uncovering tandem repeats in error-prone long-read sequencing data.

Harris, Robert S; Cechova, Monika; Makova, Kateryna D.

Bioinformatics ; 35(22): 4809-4811, 2019 11 01.

Artigo em Inglês | MEDLINE | ID: mdl-31290946

RESUMO

SUMMARY: Tandem DNA repeats can be sequenced with long-read technologies, but cannot be accurately deciphered due to the lack of computational tools taking high error rates of these technologies into account. Here we introduce Noise-Cancelling Repeat Finder (NCRF) to uncover putative tandem repeats of specified motifs in noisy long reads produced by Pacific Biosciences and Oxford Nanopore sequencers. Using simulations, we validated the use of NCRF to locate tandem repeats with motifs of various lengths and demonstrated its superior performance as compared to two alternative tools. Using real human whole-genome sequencing data, NCRF identified long arrays of the (AATGG)n repeat involved in heat shock stress response. AVAILABILITY AND IMPLEMENTATION: NCRF is implemented in C, supported by several python scripts, and is available in bioconda and at https://github.com/makovalab-psu/NoiseCancellingRepeatFinder. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Sequenciamento de Nucleotídeos em Larga Escala , Genoma Humano , Humanos , Nanoporos , Análise de Sequência de DNA , Software , Sequências de Repetição em Tandem

16.

Correction to: Utilization of atmospheric solids analysis probe mass spectrometry for analysis of fatty acids on seed surface.

Cechová, Monika; Hradilová, Iveta; Smýkal, Petr; Barták, Petr; Bednár, Petr.

Anal Bioanal Chem ; 411(17): 3989-3990, 2019 07.

Artigo em Inglês | MEDLINE | ID: mdl-31089785

RESUMO

The authors would like to call the reader's attention to the following correction in the section "Semiquantitative analysis", page 1176, of the original publication.

17.

Utilization of atmospheric solids analysis probe mass spectrometry for analysis of fatty acids on seed surface.

Cechová, Monika; Hradilová, Iveta; Smýkal, Petr; Barták, Petr; Bednár, Petr.

Anal Bioanal Chem ; 411(6): 1169-1180, 2019 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-30617396

RESUMO

Atmospheric solids analysis probe mass spectrometry (ASAP-MS) was used for the first time for direct surface analysis of plant material. It can be readily used for surface analysis of whole and intact pea seeds and their seed coats, and for the study of the profile of fatty acids on the outer surface. Furthermore, ASAP-MS in combination with multivariate statistics allowed classification of pea genotypes with respect to physical dormancy and investigation of related biological markers. Hexacosanoic and octacosanoic acids were suggested to be important markers likely influencing water transport through the seed coat into the embryo (with the highest significance for dormant L100 genotype). ASAP-MS provided higher selectivity and better signal of fatty acids compared to (MA)LDI-MS (laser desorption ionization mass spectrometry either matrix free or matrix assisted) providing on the other hand spatial distribution information and results obtained by both methods are mutually supportive. The developed ASAP-MS method and obtained results can be widely utilized in biological, food, and agricultural research. Graphical abstract á.

Assuntos

Ácidos Graxos/análise , Espectrometria de Massas/instrumentação , Sementes/química , Desenho de Equipamento , Espectrometria de Massas por Ionização e Dessorção a Laser Assistida por Matriz/instrumentação

18.

Long-read sequencing technology indicates genome-wide effects of non-B DNA on polymerization speed and error rate.

Guiblet, Wilfried M; Cremona, Marzia A; Cechova, Monika; Harris, Robert S; Kejnovská, Iva; Kejnovsky, Eduard; Eckert, Kristin; Chiaromonte, Francesca; Makova, Kateryna D.

Genome Res ; 28(12): 1767-1778, 2018 12.

Artigo em Inglês | MEDLINE | ID: mdl-30401733

RESUMO

DNA conformation may deviate from the classical B-form in â¼13% of the human genome. Non-B DNA regulates many cellular processes; however, its effects on DNA polymerization speed and accuracy have not been investigated genome-wide. Such an inquiry is critical for understanding neurological diseases and cancer genome instability. Here, we present the first simultaneous examination of DNA polymerization kinetics and errors in the human genome sequenced with Single-Molecule Real-Time (SMRT) technology. We show that polymerization speed differs between non-B and B-DNA: It decelerates at G-quadruplexes and fluctuates periodically at disease-causing tandem repeats. Analyzing polymerization kinetics profiles, we predict and validate experimentally non-B DNA formation for a novel motif. We demonstrate that several non-B motifs affect sequencing errors (e.g., G-quadruplexes increase error rates), and that sequencing errors are positively associated with polymerase slowdown. Finally, we show that highly divergent G4 motifs have pronounced polymerization slowdown and high sequencing error rates, suggesting similar mechanisms for sequencing errors and germline mutations.

Assuntos

DNA/química , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Conformação de Ácido Nucleico , Análise de Sequência de DNA , Replicação do DNA , Quadruplex G , Genômica/métodos , Genômica/normas , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Humanos , Cinética , Mutação , Motivos de Nucleotídeos , Reprodutibilidade dos Testes , Análise de Sequência de DNA/métodos

19.

RecoverY: k-mer-based read classification for Y-chromosome-specific sequencing and assembly.

Rangavittal, Samarth; Harris, Robert S; Cechova, Monika; Tomaszkiewicz, Marta; Chikhi, Rayan; Makova, Kateryna D; Medvedev, Paul.

Bioinformatics ; 34(7): 1125-1131, 2018 04 01.

Artigo em Inglês | MEDLINE | ID: mdl-29194476

RESUMO

Motivation: The haploid mammalian Y chromosome is usually under-represented in genome assemblies due to high repeat content and low depth due to its haploid nature. One strategy to ameliorate the low coverage of Y sequences is to experimentally enrich Y-specific material before assembly. As the enrichment process is imperfect, algorithms are needed to identify putative Y-specific reads prior to downstream assembly. A strategy that uses k-mer abundances to identify such reads was used to assemble the gorilla Y. However, the strategy required the manual setting of key parameters, a time-consuming process leading to sub-optimal assemblies. Results: We develop a method, RecoverY, that selects Y-specific reads by automatically choosing the abundance level at which a k-mer is deemed to originate from the Y. This algorithm uses prior knowledge about the Y chromosome of a related species or known Y transcript sequences. We evaluate RecoverY on both simulated and real data, for human and gorilla, and investigate its robustness to important parameters. We show that RecoverY leads to a vastly superior assembly compared to alternate strategies of filtering the reads or contigs. Compared to the preliminary strategy used by Tomaszkiewicz et al., we achieve a 33% improvement in assembly size and a 20% improvement in the NG50, demonstrating the power of automatic parameter selection. Availability and implementation: Our tool RecoverY is freely available at https://github.com/makovalab-psu/RecoverY. Contact: kmakova@bx.psu.edu or pashadag@cse.psu.edu. Supplementary information: Supplementary data are available at Bioinformatics online.

Assuntos

Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Software , Cromossomo Y , Algoritmos , Animais , Cromossomos de Mamíferos , Genômica/métodos , Gorilla gorilla/genética , Humanos , Masculino , Mamíferos

20.

Towards Better Understanding of Pea Seed Dormancy Using Laser Desorption/Ionization Mass Spectrometry.

Cechová, Monika; Válková, Markéta; Hradilová, Iveta; Janská, Anna; Soukup, Ales; Smýkal, Petr; Bednár, Petr.

Int J Mol Sci ; 18(10)2017 Oct 21.

Artigo em Inglês | MEDLINE | ID: mdl-29065445

RESUMO

Seed coats of six pea genotypes contrasting in dormancy were studied by laser desorption/ionization mass spectrometry (LDI-MS). Multivariate statistical analysis discriminated dormant and non-dormant seeds in mature dry state. Separation between dormant and non-dormant types was observed despite important markers of particular dormant genotypes differ from each other. Normalized signals of long-chain hydroxylated fatty acids (HLFA) in dormant JI64 genotype seed coats were significantly higher than in other genotypes. These compounds seem to be important markers likely influencing JI64 seed imbibition and germination. HLFA importance was supported by study of recombinant inbred lines (JI64xJI92) contrasting in dormancy but similar in other seed properties. Furthemore HLFA distribution in seed coat was studied by mass spectrometry imaging. HLFA contents in strophiole and hilum are significantly lower compared to other parts indicating their role in water uptake. Results from LDI-MS experiments are useful in understanding (physical) dormancy (first phases of germination) mechanism and properties related to food processing technologies (e.g., seed treatment by cooking).

Assuntos

Ácidos Graxos/análise , Dormência de Plantas , Sementes/fisiologia , Espectrometria de Massas , Sementes/metabolismo

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA